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A multi-windowing technique lor threshbolding an image using local image properties 



(54) 

(57) A technique. specHfcaly apparatus and an 
accompanying method, tor accurately thresholding an 
image based on local image properties, specifically lumi- 
nance variations, and particularly such a technique that 
uses multi-windowing tor providing enhanced immunity 
to image noise and lessened boundary artifacts. Specif- 
ically a localized intensity gradient. Gf. j). is determined 
tor a pre-defined window (300) centered about each 
image pixel(ij). Localized minimum and maximum pixel 
Intensity measures. L^ and respectively are also 
determined for another, though larger, window (330) cen- 
tered about pixel(ij). Also, a localized area gradient 
measure. GSf.j). is determined as a sum of «ndividual 
intensity gradients for a matrix of pixel positions (370) 
centered about pixel position (i j). Each image pixel(ij) is 
then classified as being an object pixel, i.e.. black, or a 
background pixel, i.e. white, based upon its area gradi- 
ent GS(i j). and associated L™, and L™, measures. 
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Description 

TECHNICAL F ^l n OF THE INVENTION 

The invention relates to a technique, specifically apparatus and an accompanying method, for accurately thresh- 
olding an image based on local image properties, specifically luminance variations, and particularly one that uses murti- 
windowing lor providing enhanced immunity to image noise and lessened boundary artifacts. 

RACK^RHUND ART 

With digital image processing and digital communication becoming increasingly prevalent today, increasing amounts 
of printed or other textual documents are being scanned tor subsequent computerized processing of one form or another 
and/or digital transmission. This processing may involve, for example, optical character recognition, for converting printed 
characters, whether machine printed or handwritten, from scanned bit-mapped form into an appropriate character set, 
such as ASCII, the latter being more suitable for use with word processing and similar computerized document-process- 
ing taste. 

Scanning a gray-scale document typically yields a multi-bit, typically eight-bit, value fa each pixel in the scanned 
document. The value represents the luminance, in terms of e 256-level gray scale, of a pixel at a corresponding point 
in the document. These pixels are generated, depending upon the resolution of the scanner, frequently at resolutions 
of 200-400 pixels/inch (approximately 80-160 pixels/cm), though with highly detailed images at upwards of 1200 or more 
pixels/inch (approximately 470 pixels/cm}. Consequently, a scanned 8 1/2 by 1 1 inch (approximately 22 by 28 cm) image 
will contain a considerable amount of gray-scale data. Inasmuch scanned text generally presents a written or printed 
characters of some sort against a contrasting colored background, typically white or black print against a white or light 
colored background, or vice versa, the exact luminance value at any one pixel in the text is not as important as whether 
that pixel is either part of a character or the background. Therefore, scanned textual images, or scanned textual portions 
of larger images containing both text and graphics, can be efficiently represented by single-bit pixels in which each pixel 
in a scanned image is simply set to, e.g.. a "one" if that pixel in the original image is part of a character or part of 
foreground information, or to, e.g., a "zero" if that pixel in the original image is part of the image background. To easily 
distinguish the different types of scanned images, a gray-tevel image is defined as one having multi-bit (hence mufti- 
value) pixels, whereas a binary (or bi-level) image is formed of single-bit pixels. Furthermore, since binary images gen- t 
erate considerably less data for a given textual image, such as, e.g.. one-eighth, as much as for an eight-bit gray-scale 
rendering of the same image, binary images are more efficient over corresponding gray-scale images and thus preferred 
for storage and communication of textual images. Binary images are also preferred because of their easy compressibility 
using standard compression techniques, e.g., CCITT Groups 3 or 4 compression standards. 

Gray-scale images are converted to binary images through a so-called thresholding process. In essence/each mutti- 
bit pixel value in a gray-scale scanned image is compared to a pre-defined threshold value, which may be fixed, variable 
or even adaptively variable, to yield a single corresponding output bit. If the mufti-bit pixel value equals or exceeds the 
threshold value tor that particular pixel, the resultant single-bit output pixel is set to a "one*; otherwise if the threshold is 
greater than the multi-bit pixel, then the resultant single-bit output pixel remains at "zero". In this manner, thresholding 
extracts those pixels, such as those which form characters, or other desired objects, from the background in a scanned 
gray-scale image, with the pixels that form each character, or object, being one value, typically that for black, and the 
pixels tor the background all being another value, typically that for white. For ease of reference, we will hereinafter 
collectively refer to each character or other desired object in the image as simply an "object". 

Ideally, the best thresholding process is one which accurately selects all the object pixels, but nothing more, in the 
scanned image and maps those pixels to a common single-bit value, such as, e.g., "one" for black. In practice, noise, 
background shading, lighting non-uniformities in a scanning process and other such phenomena, preclude the use of a 
single fixed threshold for an entire image. In that regard, if the threshold is too low, the resulting image may contain an 
excessive amount of noise in certain, if not all regions; or, H too high, insufficient image detail, again in certain, H not ad, 
regions - thereby complicating the subsequent processing of this imaged Given this, the art recognizes that a preferred 
approach would be to select a different threshold value that is appropriate to each and every pixel in the scanned image. 
In doing so, the proper threshold value is determined based upon local properties of the image, i.e., certain image 
characteristics that occur in a localized image region for that pixel. Hence, the threshold would vary across the image, 
possibly even adapt to changing localized image conditions. 

In general, a common methodology for variable thresholding relies on measuring localized image characteristics, 
such as local intensity contrast (or gradient), local averaged intensity and/or local variance, in a local window centered 
about a pixel of interest and then using these measures to classify image pixels into either object pixels, black, or back- 
ground pixels, white. Here, too, reality diverges from the ideal inasmuch this methodology is complicated, and often 
frustrated, by a need to extract various objects in a wide range of documents but with minimal user intervention, such 
as for purposes of initialization and object identification, and while still yielding a clean background in the thresholded 
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♦„ «n b iectsmayindude.e.g..dim.b^ 
image. In realrty. these objec^^y,ncux^ g. entoedded in a black background. 

such as while objects .n a gray backg ou ~; a ™ fcesholding to provide adequate performance with actual 

Nevertheless, given the overwhehnrng *2S!S««nS #* caches thaVattempt to provide satisiactory 
images, the art has persisted by ^^^^^^^Z. in practice, terS to their utility, 
performance. However, all these approaches *^^^S!SS^ properties, are taught in M. Kernel et 
Pe various approaches -Jf^"^ 

a... "Extraction of ^S^TS^m S»S?hSi a "logical lever approach is based on comparinpa 
^^ocessing. Vol. 55. No 3. M»yJ 993. ^ touT loca | averages in neighborhoods 

gray level ofl^ven pixel or rts smoothed gray level tN nmage » noisy, fe be|aw 

Entered about tour pixels ort ^ ona ^ -mask^sed subtraction-, relies 

all tour local averages, then the given p.xe. « e ^^j^^ 8nd a character/Braphics image. First, most ol 
on considering every pixel m ■ ^mage as a sum mg. a ^ ^ g . ^ The 

the background pixete are detected us ^^^^^ ^S ^es passing through each given pixel with slopes 
.merisappnedtoto^ X-Sw*- 1** which are ^ and 

o, o. n/4. n/2 and 3nM. The .esurt.ng -fnered oinary jmag modrHed by detecting additional back- 

background pixels which are whrte ^^^^^Zty possible character/graphics pixel, the gray level 
ground pixels using a P redeterm.ned stroke w ^^.^^^ 0 ^ pixe te. Lastly, a gray-scale character/graph- 
of Hs background image is ^ original scaled image with resulting 

ics image is obtained b * ^^h^ image. Though the global threshold 

differences then being globally ^« h ^^ 

value itseH is fixed, basing the resuWon £ <"™5£*£2 ^ approache6 would appear to be somewhat immune 

exIrSg dim and/or broken objects in a ^^^'^ 670 (issued t0 r.r.a. Morton et al on September 

Another approa*. as desaib^m 
19 nsssandcwnedbythepresentasswneehe eof) • tee*ack signaL Here, whenever a transition 
value being a sum of a tracked bacKg'C.rxJ vatue. a n«se * ; B reKjefined to momentary 

occurs in the image, such as an edge. "SSSSSSifM value has a reduced noise content, 

modify the threshold value such ^°f^^^^^^s at intensity transitions due to abrupt changes 

I"es1n detecting low contrast «^ete; 468i7 04 (issued to J.C. Stoffel et al on August 28. 1984). 

A further approach ,s described .n u ^®^™^^^et potent which is obtained on a pixel-by-pixel 
Here adaptive thresholding is .mplemented by an ^f^^e This offset potential is used m conjunction 
. basl as a function of white peak and >' a * * a The peak and valley poten- 

vvrth nearest neighbor ^ ^ g'^SSJS. Sentential of that pixel with predetermined minimum 

^e7ea^ £ ^ B ""~ t0 " 

extracting low contrast ^^T^T^!^ specHfcally apparatus and an accompanying method, for accu- 
„ Therefore, a need exists in the art tor a }"*^£***^ s ^ ein a threshold that varies based on local 
ralely and reliably thieshotoing a ^^^SSStSl S —y ar^reduced boundary artifacts, as well as 

^ j^wanriaR in the art through our inventive multi-window 
We have substantially and advantageously overcome the def.aenc.es in the an ug 

thresholding technique. lrica i i2e d intensity gradient. G(i j). is determined for each pixel(i j) in an incoming 

Specifically, throughour^ 

so scanned gray-scale bit-mapped .mage. Th,s gradient ,sde ™ n ~ ro , are detecled ^ an N-by-N pixel window 
" a £ui pixel(ij). Next, minimum intensity. ^ 8 SUm 01 i '* iV '' dUa, 

centered over the image about p.xel(.j) In add-on. a ^^"^JJ? n . 2 - b y. n -2 pixels, centered about position <ij). 
gradients, ^ascerta^f-a^tox ^ ^ ^ primarily ^ 

lach pixelfij) is tm das* *-« tekssociatSu,, and U values. These three parameters are 

«■ uDon its area gradient. GS(i.j). and seconoa ny upon to « centered in various windows, one tor determining 

M ^ s 5SSS«^St. - U- On.*, « 

gradient GS(i j). 
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Using these local measures, each image pixel(i j) is f irst classified as being near an edge or not. In that regard, each 
pixel near an edge is so classified if the local measures therefor, specifically its associated area gradient, exceeds a 
pre-defined threshold value. Otherwise, that pixel is viewed as falling within a locally flat field, i.e., of relatively constant 
gray-scale. Specifically, edge pixels are classified by comparing their intensity values against average intensity values 
for local N-by-N pixel windows therefor; in contrast, pixels in flat fields are classified by comparing their intensity against 
a predetermined intensity value. 

In particular, an image pixel(ij) located in a vicinity of an edge in the image is detected whenever the area gradient, 
GS(ij). for that pixel is high, particularly being larger than a predefined gradient threshold, GT. Once such a pixel is 
found, each pixel on a darker side of the edge is found. This latter pixel being in the center of a local N-by-N, e.g., 7-by- 
7 pixel window is classified as an object pixel, i.e., black, whenever its intensity, Uc is smaller than an average of its 
associated and Lma* values. Alternatively, if the intensity of this latter pixel, Lc. exceeds the average intensity vaiue 
of linh and Lma,. then this particular pixel is classified as being background, i.e., white 

Where the area gradient, GS(i j). for pixel(ij) is less than the gradient threshold, such as where the N-by-N window 
slides along an image region of relatively constant gray-scale, pixel(ij) is classified by simple thresholding, i.e., thresh- 
olding its gray-scale value against a constant predefined threshold value, IT. Here, if the gray-scale value of pixel(i j) is 
less than the threshold value IT. this pixel is dassified as an object pixel, i.e., black; otherwise, this pixel is designated 
as part of the background, i.e., white. 

To enhance image edges and by doing so further increase thresholding accuracy, the intensity values of those pixels 
near an edge are modified. For the preferred embodiment, black pixels are illustratively represented by a value of zero 
and white pixels by a value of "1 Specifically, tor those pixels located on a darker side of an edge and having an intensity 
vaiue thai exceeds the threshold value. !T. the intensity erf each oi these pixels is reduced to a value slightly below 
threshold IT, e.g., to a value IT-. In contrast, the intensity values for those pixels, located on a brighter side of an edge 
and having an intensity value less than threshold value IT, are increased to an intensity value slightly greater than thresh- 
old value IT, e.g.. to value IT*. A high quality binary image can then be produced by merely thresholding the resulting 
gray-scale image against pre-defined threshold value IT. 

Advantageously, through the use of area (summed intensity) gradients, the present invention significantly reduces 
both background noise and boundary artifact generation while accurately extracting objects in a threshokJed image. The 
modified gray-scale image, when subsequently thresholded using a fixed threshold value IT, greatly enhances detection 
of low contrast objects and thus readily produces a high quality binary image. 

RRIFF PFSCRf PTinM OF THE DRAWINGS 

The teachings of the present invention may be readily understood by considering the following detailed description 
in conjunction with the accompanying drawings, in which: 

FIG. 1 depicts a high-level block diagram of document imaging system 5 that embodies the teachings of our present 
invention; 

FIG. 2 depicts the correct alignment of the drawing sheets for FIGs. 2A and 2B; 

FIGs. 2A and 2B collectively depict a high-level flowchart of our inventive multi-windowing thresholding method; 

FIG. 3A depicts a 3-by-3 matrix of pixel location definitions which we use in determining, through a "Sober gradient 
operator, gradient strength for current pixelflj); 

FK3. 3B depicts a graphical representation of an N-by-N neighborhood of pixels centered about a cunent pixel(i j) 
in an image intensity record; 

FIG. 3C depicts a graphical representation of an (N-2)-by-(N-2) neighborhood of pixel6 centered about pixel position 
(i.j) in an image intensity gradient record; 

FIG. 4 graphically depicts intensity profile 400 of an illustrative scan line of an original gray-scale image, such as 
that illustratively on document 10 shown in FIG. 1 ; 

FIG. 5 graphically depicts a modified intensity profile which results after thresholding intensity profile 400 shown in 
FIG. 4 according to the teachings of our present invention; 

FIG. 6 depicts a block diagram of a preferred embodiment of our present invention; 
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shown in FIG. 6. 
ments thai are common to various figures. 

„ in *he ari will Quickly reafee that our invention is applicable tor 
A „er reading the losing desc 'f documentary images containingtine 

use in nearly any "JJ-J - ^1£SSS£m» A* * e ,ma 9 e ori9ina,,y aPpear6 - ? •.•"K*' 

art. both regardless of what that line art "JT^ta (s^D d'ewings. Nevertheless for the purpose of simplifying 

L.ea.cou™ 

rSKS^KU which we co^ef^i^ J ^ ^ ^ 
86 Tor purposes of i.lustration <*»~""^^ 

scale value of "0" and "255-. respective ly For relevant 
^d^ 

invention As shown, system 5 is termed d grayscale «»3^JJ^ ^scanned by scanner 20 to produce muM- 
SS 40 .n operation, document 10 contain,^ via leads 25. to a data input of digital 

S M*a>y W-mapped grayscale .mage date . L Th£ data is ^ ^ ^ ^ ^ 
Snag? Processor which, in turn and to the extert » The rnc^ied grayscale image, as 
SLigray-soale image on ^ 

described in greater detail below, has its edges enhanceo oy p thresholding drcurt 40 which 

fai T^mied grayscale '^^^ ^a pre^dned fixed thresh* value thereby 

comoares each multi-bit image pixel m the modrf«d image « arc 9 produced by circuit 40 is applied. 

via leads 45. as another output b.nary image. Digital ■ "1 unmapped images, processor 30 «s prefer- 

pTes^Untion. To ^"dle Wgh d^ro^ * KSSSSIi eSolownin FIG. 6 and discussed in 

ablv fabricated from dedicaled hardware circuitry nMM«w ™ ' general purpose microprocessors. 

S * ow. Alternatively, this processor can be lorammiTo implement our present invention in soft- 

Sassccfct*™^ su« W en,y ted to produce the re.dsite date 

S^r^ ™veT 9 e f ayS -eby derating -ugh *e 

Method 200 is performed for ^£T2SlE5pM is processed is Wentica. across all such pixds. we 
• ima g e . Since the manner through which ea ch j gr ay scale ma je P ^ where . an£j . are integere 
Merely discuss this routine n the ^^JJ2K gC-scate image. In addition to grayscale image data, 
ddine pixel horizontal and vert.ca ^*^^™™££Zls*** «T and GT. These threshold* are adjusted, 
a user supplies values of two input parameters. ^^'J/"" ' |tHude d dmerinQ images with varying char- 

6 asTescrS bdow. eHher to provide ^^^ a ^STp^e optimum thresholding tor any one type of 
acterid.es. such as contrast and dlunwrton. « Snorming method 200 on all such grayscale '^sepjxels. 
rmaoe and/or scanning condition. In any event, as a \^°^™ ^ h ^ ch pixe | in the resdting image correspond,^ 
Ssholded binary or rncdHied gray scale ^ 

KSfiT S2£ ~ ^nX^er — -cgh scalino or hHermg. as 



£5 



is oeiineu uciw». - 

daSTS » acceded by us, 0. the «<«*«*J i f a f^^Swcenle. e <)8bou. C ».aCJ).A S cc. 1 ec>l»el, 



5 



EP 0 712 094 A2 



GX(i j) and GY(ij), respectively, and for each pixel position (ij) forming the gradient strength. G(i,j) as an absolute sum 
ofGX(ij)andGY(ij): 

GX(ij) = L(h1 j-1) + 2L(i+1 j) + L(i + 1 ( i+1) - L(i-1j«1) • 2L<i-1 j) - L(i-1 j+1) (1) 

GY(ij) - L(i-1 j+1) + 2L(ij+1) + Lfi+l.j+1) - Lfi-1 j-1) • 2L(ij-1) - Lfl+1 j-1) (2) 

G(U)«|GX(ij)|4|GY(i.i)| V O) 

• where: G(i.j) is the gradient strength at pixel position 
(ij);and 

L(U) fe the image intensity, luminance, at pixel 
position (i j). 

The resulting gradient strength value for every image pixel collectively forms a gradient strength record for the entire 
scanned image. 

Once the gradient strength, G(ij), is determined for pxelflj), method 200 advances to step 206. This step, when 
performed, determines the area gradient for pixel position (i j), i.e.. GS(ij). as the sum of the gradient strengths for each 
and every pixel position within an (N-2)-by-(N-2) window centered about pixel position (i j). This window is illustratively 
5-by-5 pixels in size and is shown, as window 370. in FIG. 3C for an N-by-N, illustratively 7-by-7, window shown in FIG. 

> 3B, the latter being window 330, centered about pixel(ij). For purposes of comparison, the periphery of an N-by-N 
window is shown as window 350 in FIG. 3C. Though shown sequentially to simplify illustration, step 245 is generally 
performed at essentially the same time, as symbolized by dashed line 244. as is steps 203 and 206. Step 244 determines 
both minimum and maximum pixel intensity values. and L^, respectively, in an N-by-N window centered about 
pixeKij). 

» Once the area gradient is determined for pixel position (i j). then, through the remainder of method 200, the three 

image measures. GS(i.0 and and L^, associated with this particular pixel position are used to classify pixelfij) 
as an object pixel, i.e.. black, or a background pixel, i.e., white. 

In particular, decision step 208 is performed to assess whether the value of the area gradient for pixel(i j), i.e.. GSfi j), 
exceeds a pre-defined threshold value, GT. or not. This test determines whether pixelfij) lies in a vicinity of an edge in 

7 the scanned image, or not. ff the area gradient is less than the threshold value. GT, then pixel(ij) does not lie near an 
edge. Hence, p»xel(i.j) lies within a localized relatively constant tone, i.e., Hat" field, region of the scan ned grayscale 
image. In this case, method 200 advances along NO path 21 1 emanating from decision block 208 to decision block 215. 
This latter decision block determines whether the gray-scale intensity of pixel(i,j), i.e., Ug, exceeds another pre-defined 
threshold value, IT. If this gray-scale value is less than the threshold, the pixel is classified as an object pixel, i.e.. here 

i black; otherwise, the pixel is classified as a background pixel, i.e., here white. Specifically, on the one hand, if pixel 
intensity, Lc is less than or equal to threshold IT, decision block 215 advances method 200, along NO path 217, to 
decision block 222. This latter decision block determines whether a user has previously instructed the method to produce 
a binary or gray-scale output pixel. If a binary pixel is desired, then decision block 222 advances the method, via YES 
path 225, to block 228 which, in turn, sets an intensity of output pixel(ij) to black, i.e.. zero. Alter natively, if a gray-scale 

> output is desired, then decision block 222 advances method 200, via NO path 224, to block 230. This latter block, when 
performed, sets the intensity of output pixel(i.j) to the intensity of L^, without any modifications made thereto If. on the 
other hand, pixel intensity, Uc. exceeds threshold IT, decision block 215 advances method 200, along YES path 218, to 
decision block 232. This latter decision block determines whether a user has previously instructed the method to produce 
a binary or gray-scale output pixel. If a binary pixel is desired, then decision block 232 advances the method, via YES 

? path 235, to block 237 which, in turn, sets an intensity of output pixel(i.j) to white, i.e., one. Alternatively, if a gray-scale 
output is desired, then decision block 232 advances method 200, via NO path 234, to block 241 . This latter block, when 
performed, sets the intensity of output pixel(i j) to the intensity of L^, without any modif ications made thereto. Once step 
228. 237 or 241 is performed, method 200 is completed for pixel(i j); the method is then repeated for the next image 
pixel in succession, and so forth. 

) Alternatively, if area gradient. GS(i,j) exceeds the threshold value. GT. then pixel (i j) lies in the vicinity of an edge. 

In this case, step 245 is now performed to ascertain the values of and L^, if these values have not already been 
determined, as set forth above. Thereafter, method 200 advances to block 248 which calculates an average pixel intensity 
value, l-evg, occurring within the N-by-N pixel window centered about pixel position (i j). This average intensity value is 
simply determined by averaging the values L^, and L,™. 

Once this average value has been ascertained, method 200 advances to decision block 251 which compares the 
intensity of pixelfij), i.e., Lc. against its associated average intensity value, L^g. When this pixel is situated on a darker 
side of an edge and hence an object pixel, then its associated average intensity value will be greater than or equal to 
its intensity value, L^. In this case, the output binary pixel for pixel position (i j) will be set to black. Otherwise, if pixel(i j) 
is situated near an edge but its average intensity value. L avg . would be less than its intensity value, L^, then this pixel is 
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increase thresholding accuracy. « Mr , vel « ix i^s than or equal to its associated average intensity value. 

In particular, il gray-scale pixel 1 ^ *?!J n ^ Mrf^^^ todecision block 260. This latter decision block 

L then decision block 251 advances method 200. along NO path ^ a 
^Irrnfnes whether a user has previous* Zf^SfSS Sn%£. * «ock 2» «*£ in turn, 

binary Pixe. is desired, then ^yZe out£ is desired, then dedsionbiock 260 

sets an intensity of output SfSo This tetter block when performed, determines whether 

advances method 200. via NO pathHSU £«on I 27 . Tta atte «K ^ P ^ ^ ^ 

pixel intensity, k- exceeds the P'^*^^ 8 *" sets the grayscale output tor 

IT. then method 200 advances, along NO path 272^o block 275 value of threshold IT, then method 
^la^^^ ^ se. »e grayscale output lor pixeJQJ) to a 

valued is slightly less. La, IT-, than ^^hoW IT associated ave rage intensity value. 1^,. thendedsion block 

Alternatively, il gray-scale pixel ^ty ^ f ££££E*S This latter decision block determines whether 
25 1 advances method 200. ^ng £E8 ^path JJ^^TS^XS- ^ M 8 ^ * deS ' fed - 

a user has previously instructed ^ meth« " b ™£ ^ Wock28S which, in turn, sets an intensity of output 
then decision block 280 advances the method/ v* YES pat 280 advances method 200. 

* pixe,(i j) «o white, i-e o-H = J a ^ay-sea ta^jpuM. J^S«3-«-n« whether pixel intensity. U. is less 
via NO path 284. to decs.on block 290.™* totter ^ P ^ or fe ^ to threshold IT. then method 200 
than pr e^ef ined threshold I value len exe^uS s^The gSy-sca.e output for pixelf. j) to the pixel 
advances, along NO path 292. «° ^tock 295 which ^ en ^ „ ^ melhod 200 advances; along YES path 
intensity value U- N ~ - «^"*2* for pixelf. j) to a value that is slightly larger. I.e, IT*. 

* 294. to block 297. which, when executed, sets the ^ ^ than> ie spectrvely. threshold IT is not critical 
r rJSS has been pedormed for p*e,<ii): me^od 200 is 

fhen'repeated for the next ^ge pixelM ""JJ^J^S^ aepicts intensity profile 400 of an illustrative scan 
With this understanding .n mmd, consider FIG. f ^^rap«« yjP in FK3. 1 . As a result of processing 

nne of an ongina. grayscale .mage juch ^*2^^!SS!3«-n in FIG. 5. results. As is readily apparent 
profile 400 through our inventive method modified .mens y p«« o an ^ aJe modrfied. 

Som comparing profile* 400 ar* 500 ^SS^Z *JSf?£ «s intensity is larger than threshoW 

in this regard, when such a pixe ffi ; located a ^ « Q 8n *J than threshold IT. Alternatively, when an image 

value IT. the intensrty of this p.xe. is KS?JS San threshold IT; the intensity of this pixel is 

^kSS'aK 

TO ; en ^^o^ 

As shown, circuit 600 is formed of line deteys 6 II V » e J ? ^ and Mjn deteclor ggg; 

gradient detector 615; sum of ^'^^^^^SoSS'ol inverS 664. and gates 662. 670 and 
averaging circuit 640; muttp lexers ^^f^^^^piGs. 2A and 2B and discussed in detail above. 
«s 675. Circuit 600 collectively ^°™^™J™\ SS2. in parallel. V* leads 605 and 608. to an input of 

,n operation, eight-brt input gray-scale imagei »«els(ij) < arc P specifically series-connected single 

,ine detoys 610 and to an hput of 7^-7 Max and J^J^^I^p!- ^ - aroundimage P ixel(i. D . 
Nne delay elements 610,. 610 2 and 610,. p ov.de a 3-by » £ ne o1 window . Tn e outputs of all three 
Each delay element is tapped at three k>cat,ons tc J^JSdST 61 whteh. through calculation of the Sobe. 
delay elements are routed to respective inputs of grad.ent de tect on arcun e outpM t lea06 6 17. This 

operator, as described above, provides an -«J^^^JJfSSS. sun^s all the gradient intensity values 
gTadient value is applied to an input to ^j^^^S^^ « « « afea S radien1 va,ue " l e " 
? n 8 5*y-5 window centered aboU I p,xe po, on d . ecussed in oetaiI below. This area gradient value 

GS(ij). on this value against predefined gradient threshoW. GT; 

is applied to an .nput ( A ) rt JJ^^^J^^JJ^^. The resulting comparison output, appear.ng on 
the latter being applied to another .npul ( B ] ^ 0 ^2ni^ofVn edge or not. The comparison output is applied as an 

- «• 01 the p f e ^ e,ined va,ues ,T - or ,T+ - 
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Max and Miri detector 635. depicted in FIG. 8 and discussed in detail below, ascertains the maximum and minimum 
pixel intensity values, i.e.. Unax an d Unm. of those pixels contained within a 7-by-7 window centered about pixdfij). 
Resulting L^* and values, appearing on leads 637 and 639, are then averaged by averaging circuit 640. This circuit 
contains summer 642, which merely adds these two values together into a resultant sum, and /2 and delay circuit 646 
which inplements a divide by two operation by shitting the sum one-bit to the right; hence dropping the least significant 
bit. The resultant averaged intensity value is then appropriately delayed within circuit 646 for purposes ol achieving 
proper synchronization within circuit 600. 

If the area gradient value, 6S(i j), exceeds the pre-defined gradient threshold, GT, then the averaged intensity value, 
L avg . used as a threshold level in determining the value of output center pixel(i j); else, if the area gradient value is less 
than or equal to the gradient threshold, then value IT is used as the threshold level in determining the value of output 
center pixel(i j). In this regard, the averaged intensity value, L evg , is applied to one input of comparator 650. The current 
center pixel value, is applied through delay element 630 to another input of comparator 650 as well as to one input 
of corrparator 655. The delay of element 630 is set to assure that the proper corresponding values of Ug and are 
synchronously applied to collectively comparators 650 and 655, multiplexors 680 and 690, and gating circuit 660. The 
fixed threshold value, IT, is applied to another input of comparator 655. Comparator 650 produces a low or high or low 
level output on lead 653 if pixel intensity value is less than, or equal to or greater than, respectively, its associated 
averaged intensity value, Le V g- TOs output level is applied to both one input f 1") of multiplexor 680 and one input of 
AND gate 662. Multiplexor 680 selects the output binary pixel as between two bits generated from using two different 
thresholds, i.e., value IT or L ev g- In that regard, comparator 655 produces a high output level at its A>B output whenever 
the predefined threshold value IT is greater than pixel intensity value 1^. The level at output A>B is routed, via leads 
657, to another input ("0") of multiplexor 680 and to one input of AND gate 670. Multiplexor 680 produces the output 
binary value for pixel(i j)- To do so, the comparison output level produced by comparator 625, indicative of whether the 
area gradient, GS(i.j). exceeds the gradient threshold, GT, is applied as a select signal, via lead 628, to a select fS*) 
input of multiplexor 680. If this select signal is high, thereby indicating that the area gradient exceeds the gradient thresh- 
old, then multiplexor 680 routes the level applied to its "1" input, i.e., that generated by comparator 650, to binary output 
lead 33; else, the multiplexor routes the level then appearing at its "0" input to lead 33. 

Multiplexor 690 routes the center pixel value, L^, or a pre-defined value IT+ or IT- as the output gray-scale value to 
eight-bit output leads 37. Pre-defined values IT+ and IT- along wrtfi center pixel value Uc are applied to different corre- 
sponding eight-bit inputs to multiplexor 690. The output of this multiplexor is determined by the state of two select signals, 
SO and SI: if these signals assume the binary values 2ero, T or "3*. then the multiplexor will route center pixel value 
Uc, value IT+ or value IT-, respectively, to output leads 37. Gating circuit 660, in response to the comparison output 
signals produced by comparators 625. 650 and 655 generates, by simple combinatorial logic, the two select signals, SO 
and Si , applied to multiplexor 690. Specifically, if the area gradient is not larger than the gradient threshold, then, as 
discussed above, gray-scale pxel value Lc. without any modification, will be applied through multiplexor 690, to output 
leads 37. Alter natively, if the area gradient exceeds the gradient threshold, then multiplexor 690 will apply ether value 
IT+ or IT- as a modified gray-scale output value to lead 37 based upon whether center pixel intensity value L<; is less 
than its corresponding average pixel intensity value. L^, or not. Since the operation of gating circuit 660 is serf -evident 
from the above description to anyone skilled in the art, it will not be discussed in any further detail. Values IT- and IT+ 
are merely held in appropriate registers (not shown) and from there applied to corresponding inputs of multiplexor 690. 

FIG. 7 depicts a block diagram of Sum of Gradients Circuit 620 which forms part of circuit 600 shown in FIG. 6. 
Circuit 620, as shown in FIG. 7, is formed of adder 710 and line delays 720, the latter containing four series-connected 
single-line delay elements 720o, 720 1t 720 2 and 72O3. Incoming gradient intensity values are applied, over lead 617 
from gradient detection circuit 615 (shown in FIG. 6), to one input of adder 710, shown in FIG. 7. In addition, the output 
of each of the four line delays 720 is routed over a different one of four feedback leads 715 to a different corresponding 
input of this adder. As a result adder 710 forms partial sums of 1-by-5 gradient values which are applied, as input, to 
delay element 720o. To generate a 5-by-5 sum of the intensity gradients, hence producing en area gradient measure, 
four preceding conesponding 1-by-5 sums are added together within adder 71 0 to a present 1-by-5 sum, with the result* 
ant overall 5-by-5 sum being applied to output lead 622. Use of these delay elements connected in this fashion substan- 
tially reduces the amount of logic that would otherwise be needed to generate twenty-five intermediate sums. 

FIG. 8 depicts a block diagram of 7«by-7 Maximum and Minimum Detector 635 which also forms part of circuit 600 
shown in FIG. 6. As noted above, detector 635 determines the maximum and minimum pixel intensity values, and 
L^. respectively, within a 7-by-7 window centered about pixel position (i.j). This detector is formed of two separate 
circuits: maximum detecting circuit 805 and minimum detecting circuit 840 which extract a pixel value having a maximum 
and minimum intensity value, respectively, of all the pixels contained within the 7-by-7 window centered around pixel(i j) 
and respectively apply those two values to leads 637 and 639. Circuit 805 contains maximum detector 810 and line 
delays 820, the latter being formed of six series-connecled single-line delay elements 820o, 820 1t 820 2 . 82O3, 820 4 and 
82O5. Similarly, circuit 840 contains minimum detector 850 and line delays 860, the latter being formed of six series- 
connected single-line delay elements 860 0 . 860 1t 860 2< 86O3, 86O4 and 86O5. Inasmuch as circuits 805 and 840 function 
is an identical manner with the exception that detector 81 0 locates maximum values while detector 850 locates a minimum 
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■ ~ hoth from the same series ol pixel intensity values applied over lead 608. we w, only discuss creuit 805 m deta,l . 
k ' n The arTw^Hhen readHy appreciate how circuit 840 functions from this discussion, 
those skilled in the art win men rea maximum detector 810. In addition, the 

T^ P "T:Z Z^eT^ZZ™^^^ to a respect^ input o, this 
output of each of the six lint oe ays . <xv maximum pixel intensity value from a 1-by-7 series of pixel 

d ? d ° r ^SetS ™ ^^Z^errJ^O, In conjuncBon with six P^^ne- 

r e ^o^^ 

to oener ate ^t^^Z^ image produced by our invention is^ntroHed 

As one can now a P^J^"^** Tnresnold G j fe set t0 p,operly classify image pixels located in the vicinity 

°! Pixels Sed^n image retTonl of unHorm tone. i.e.. Hat' fields. In that regard, a tow IT value tends to classify 
classify pixels locatea in im^e *y relnl - plv hioh GT fe used, then our invention for all practical 

rr. » •»*-*»*- 

purposes performs srm*» » • » ^ w pb*K by essentrarry solely com- 

the art 

, Nr , ,e-r D .A, APPI tCABII ITV ANP APVANTAQES 

tk _m invention is useful in image scanning systems and particularly for thresholding scanned gray-scale 
The present ^^^^ion advantageously provides more accurate thresholding than that previously 

SLZStoJR ^ erHced immLy JtScKground noise in the scanned image and substantia* 

eliminating boundary artifacts from a thresholded image. 

Claims 

% ZHST^Si wvmo a Jrrrripondino outpu. pM value asso^ed town. ch.»cttm«l rrf .hat the 

cunent plT^Hton M to M input image, a maximum pixel value and a minimum pixel value of the pixel values 
^SS^XS^^^^^ -.ue and the maximum and minimum pixel values, a current one of 

E52^E£^ pw m * < he ^ ima9e into an ^ p,xel va,ue 81 6 correspond,n8 

position in the f irst output image. 

. ^SltSSn D in the input image, an intensity gradient. Gf. j). value for the cunent pixel position such 

%^E2^w»+* *» a p,uraWy 01 co " espondin9 in,ensHy 8rad,ent VB,ues ■ 

d ^ in 1ormfng the first window of intensity gradient values from the plurality of intensity gradient values and. in 
response to said first window, determining said area gradient value. 

„ -n, me thod in claim 2 wherein said first window forming and area gradient determining step comprises the step of 
3 ' ^nZ^Sles of the intensity gradient va.ues in said f irst window to produce the area gradient value. 
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4. The method in claim 3 wherein said thresholding step comprises the steps of: 

comparing the area gradient value. GS(i j). to a pre-defined gradient threshold value. GT. so as to produce 
a comparison signal indicative of whether a current one input pixel located at the current pixel position (ij) in the 
input image lies in a vicinity of an edge in said input image; 

H the comparison signal indicates that said current one input pixel does not Be in the vicinity of the edge, 
thresholding, in response to sakJ comparison signal, said input pixel value at position {i j) in the input image against 
a pre-defined fixed gray-scale threshold value, IT, to yield a corresponding binary output value at pixel position (i J) 
in the f irst output image; and 

H the comparison signal indicates that said one current input pixel does lie within the vicinfty of an edge, 
setting, in response to said comparison signal, said corresponding binary output value to a first or. second state If 
the current one pixel value either exceeds or is less than an average of the maximum and minimum pixel values, 
respectively. 

5. The method in claims 3 or 4 wherein said intensity gradient measuring step comprises the step of producing the 
intensity gradient value for the current pixel position by processing said third window of input pixel values through 
a Sobel operator. 

6. The method in claim 5 wherein the third window is a 3-by-3 window of input pixel values end the Sobel operator 
implements the following equations: 

GX(ij) « MM + 2L(k1 j) + L(k1 j+1) - L(M - 2L(i-1j) - Lfi-1 j+1); 

GY(ij) = L(i-1 + 2L(ij+1) + LP+1 - L(i-1 j-1) - 2L(ij-1) - L(i+1 j-1); 

and 

G(ij)-|GX(ij)|+|GY(ij)| 

where: L(i j) is an input pixel value, in luminance, for 
the pixel position (i j) in the input image. 

7. The method in claim 4 further comprising the steps of: 

producing, in response to said comparison signal and the pixel value (Lc) at the current pixel position (i j), a 
modified gray-scale output pixel value at position (i j) in a second output image, wherein. H the comparison signal 
indicates that the one current input pixel does not lie in the vicinity of the edge in said input image, the modified 
gray-scale output pixel value is set equal to the pixel value Lc. or if the comparison signal indicates that the current 
one input pixel does lie in the vicinity of the edge, the modified gray-scale output pixel value is set equal to a pre- 
defined value greater than or less than the threshold value IT when the pixel value Lc is greater or less than, respec- 
tively, the threshold value IT ; and 

thresholding the modif ied gray-scale output pixel value, against the threshold value IT. to yield a corresponding 
second binary output pixel value at a pixel position (ij) within a second output image. 

8- Apparatus for thresholding an input gray-scale image into a first output image, wherein said input image is formed 
of input pixels each having a multi-bit gray-scale pixel value associated therewith, and wherein said f irst output image 
is formed of pixels each having a corresponding output pixel value associated therewith, characterized in that the 
apparatus utilizes the steps recited in any of claims 1 -7. 



10 



EP 0 712 094 A2 



DOCUMENT 




SCANNER 



20 



GREY SCALE 
IMAGE 

DATA 



30 



DIGITAL IMAGE 
PROCESSOR 

(THRESHOLDING & 

GREY SCALE 
MODIFICATION) 



l f BINARY 
IMAGE 



37- 



MODIFIED 
GREY SCALE 
IMAGE 



SIMPLE ^ 
THRESHOLDING 

CIRCUn 



T 

40 



T 

45 



BINARY 
IMAGE 



FIG. 1 



EP 0 712 094 A2 



r 



i 
i 

! 
I 

Y 

l 





Q 

L 

I 


0 START 

GREY SCALE 
, IMAGE DATA, I 


203 


CALCULATE 


GRADIENT 




STRENGTH FOR 




PIXEL (i,j) 









206. 



CALCULATE THE 
SUM OF GRADIENT 
STRENGTH (GS) 
IN (N-2) X (N-2) 
WINDOW CENTERED ON 
PIXEL (i,j) 



— 244 



200' 



• DETECT Lmox 
j Ofld L mi n in 

N x N WINDOW 
CENTERED ABOUT 
PIXEL (i,j) 



230 
_1 



FIG. 2A 



SET 

INTENSITY OF OUTPUT 
PIXEL (ij) TO 
INTENSITY OF 
PIXEL IN CENTER 
OF WINDOW, Lc 





EXIT 



i SET 
INTENSITY 
OF OUTPUT 
PIXEL 
TO 

22B — BLACK 



END 



—245 



!fig. 

I 2A 



i ! 



j FIG. 
;2B 



CALCULATE 

(Lmox + Lmin) 
Lovg = : 



234— 


232 


SET 




INTENSITY OF 


PIXEL TO 


j L C 

I 





235 i SET 

i INTENSITY 
■ OF O'JTFtf 
; PIXEL 
I TO 
237-!, WHITE 



END 



— 241 



FIG. 2 



L 



245 




END 



12 



EP 0 712 094 A2 



FIG. 2B 




297 — 



SH INTENSITY 
or OUTPUT 
PIXEL (i.j) 

to n+ 




YES 


270 


274 — 




SET INTENSITY 


OF OUTPUT 


PIXEL (i.i) 


TO n- 





SET 

INTENSITY 
OF OUTPUT 
PIXEL 

TO L c 



END 



— 278 



END 



END 



13 



EP 0 712 094 A2 




FIG. 3B 



N 



FIG. 3C 




14 




15 



EP0 712 094 A2 




16 



EP 0 712 094 A2 

V 




FIG. 7 



17 



EP 0 712 094 A2 



■ =3 

55* 



31 



>5 z 



o 
D 



.£3 

3EO 



CO 
OO 



© 



© 



i r 



.© 



OO 



•2 




18 



(19) 




(12) 



EuiopSisches Patenlamt 

Eaop »P.e B «« EP 0 712 094 A3 

Office europeen des brevets 1 ' .• 

EUROPEAN PATENT APPLICATION 

(51) int. a. 6 : G06T5/00 



(88) Date of publication A3: 

10.07.1996 Bulletin 1996/28 

(43) Date of publication A2: 

15.05.1996 Bulletin 1996/20 

(21) Application number: 95202928.8 

(22) Date ot filing: 30.1 0.1995 

(84) Designated Contracting States: 
DEFRGBIT 

(30) Priority: 10.11.1994 US 338051 

(71) Applicant: EASTMAN KODAK COMPANY 
11 Rochester, New York 14650-2201 (US) 

(72) Inventors: 

• Lee, Yongchun, 
do Eastman Kodak Co. 
Rochester, New York 14650-2201 (US) 



CO 
< 



Bssile, Joseph M., 
c/o Eastman Kodak Co. 
Rochester, New York 1 4650-2201 (US) 
. Rudak, Peter, 
c/o Eastman Kodak Co. 
Rochester, New York 14650-2201 (US) 

(74) Representative: Bllckle, K. Werner, Dipl.-lng. et al 
KODAK AKTIENGESELLSCHAFT 
Patentabteilung 
70323 Stuttgart (DE) 




(57) A technique, specifically apparatus and an 
accompanying method, for accurately thr^holdjng an 
ZZSSZZi on local image properties ^■calylum. 
n ?nS variations, and particularly such a technique thsrt 
Ctes multi-windowing for providing entenc^mmunrty 
to Sane noise and lessened boundary artrfacts. Speed- 
ed Sad intense gradient G(ij) * 
or a pre-defined window (300) centered about each 
maae Pixelfl j). Localized minimum and maximum pxel 
ESS meatres. and U., '^^^ 
determined for another, though larger, window (330) cen- 
ierS about pixel(ij). Ateo. a located area gradient 
m«Su,rGSfin is determined as a sum of .ndwdual 

then classified as being an object p.xel. ..e.. black, or a 
ba^ground pixel, i.e.. white, based upon rts area grade 
2?£w>. and associated L^ and L_ x measures. 



canr scau 

DATA 




90 

JL 



ptoassot 

(IHRtSHOttt* * 
£RH SCAU 
WCKXriCAlDN) 



57— 



BINARY 

naCI 



WDOfftD 

cm scau 



ciRcun 



T 

40 



BIKAJTT 



FIG. 1 



CM 

© 

0. 
LU 



Phmeaby Rank Xeto. (UK) Business Seivie»s 
8.12.4/3* 



EP 0 712 094 A3 




European P««ent 
Office 



EUROPEAN SEARCH REPORT 



Category 



DOCUMENTS CONSIDERED TO BE RELEVAN' 



Ch.oeo of 



US-A-5 268 773 (PARK ET AL.) 

* abstract * 

* column 2, line 57 - column 3, line 27 * 

* column 18, line 62 - column 12, line 6 * 

EP-A-e 279 297 (FUJITSU LTD) 

* column 2, line 14 - line 39 * 

* column 4, line 26 - line 49 * 



,f document with indication, where »pprepriate, 
of relevant passages 



1-8 666T5/98 



rp.A-2 517 G92 (PHI LPS) 

* page 1. line 15 - line 34 ' 

* page 3, line 8 - line 28 * 
page 4. line 7 - line 16 * 



Rcfevant 
to claim 



AppSotkw Nraae 

EP 95 26 292B 



CLASSIFICATION OF THE 
APPLICATION (laldo) 



1-8 



1-8 



The preset* search report has be** drawn up far all da 



THE HAGUE 

CATEGORY OF CITED DOCUMENTS 

I Errio»l*riJ rdrvarl H combined with mother 
* Jocuneot t* lb* »■>« ai«£orj 
: trcheolorjcaJ baci^roaod 
: pon -writ ten disdcsaf* 
: mtrnDcdbic lOctSBeOt 



Dat »f oBT'dto* «f Ike w»t» 



TECHNICAL FIELDS _ 
SEARCHED (latXX*) 



GD6T 



10 Hay 1996 S Gonzalez Ordonez, 0 

: theory t 

; earlier p- . . 

«fi«r the filing date 
: docotacBt cited w the application 
doevtsent died tot other reasons 



L : doorme 

i'Tro^heToTth* s*»7p^tent rwoity, correspoodins 
docs to eat 



2 



